Search results for "generalized linear models"

showing 10 items of 14 documents

Differential geometric LARS via cyclic coordinate descent method

2012

We address the problem of how to compute the coefficient path implicitly defined by the differential geometric LARS (dgLARS) method in a high-dimensional setting. Although the geometrical theory developed to define the dgLARS method does not need of the definition of a penalty function, we show that it is possible to develop a cyclic coordinate descent algorithm to compute the solution curve in a high-dimensional setting. Simulation studies show that the proposed algorithm is significantly faster than the prediction-corrector algorithm originally developed to compute the dgLARS solution curve.

Cyclic coordinate descent method Differential geometry dgLARS Generalized linear models LARS Sparse models Variable selectionSettore SECS-S/01 - Statistica
researchProduct

Dual Extrapolation for Sparse Generalized Linear Models

2020

International audience; Generalized Linear Models (GLM) form a wide class of regression and classification models, where prediction is a function of a linear combination of the input variables. For statistical inference in high dimension, sparsity inducing regularizations have proven to be useful while offering statistical guarantees. However, solving the resulting optimization problems can be challenging: even for popular iterative algorithms such as coordinate descent, one needs to loop over a large number of variables. To mitigate this, techniques known as screening rules and working sets diminish the size of the optimization problem at hand, either by progressively removing variables, o…

FOS: Computer and information sciencesComputer Science - Machine Learningextrapolation[MATH.MATH-OC] Mathematics [math]/Optimization and Control [math.OC]Machine Learning (stat.ML)working setsgeneralized linear models[STAT.ML] Statistics [stat]/Machine Learning [stat.ML]Convex optimizationscreening rulesMachine Learning (cs.LG)[STAT.ML]Statistics [stat]/Machine Learning [stat.ML]Statistics - Machine Learning[MATH.MATH-OC]Mathematics [math]/Optimization and Control [math.OC]Lassosparse logistic regression
researchProduct

Implicit differentiation for fast hyperparameter selection in non-smooth convex learning

2022

International audience; Finding the optimal hyperparameters of a model can be cast as a bilevel optimization problem, typically solved using zero-order techniques. In this work we study first-order methods when the inner optimization problem is convex but non-smooth. We show that the forward-mode differentiation of proximal gradient descent and proximal coordinate descent yield sequences of Jacobians converging toward the exact Jacobian. Using implicit differentiation, we show it is possible to leverage the non-smoothness of the inner problem to speed up the computation. Finally, we provide a bound on the error made on the hypergradient when the inner optimization problem is solved approxim…

FOS: Computer and information sciencesbilevel optimizationComputer Science - Machine Learninghyperparameter selec- tionMachine Learning (stat.ML)[MATH.MATH-OC] Mathematics [math]/Optimization and Control [math.OC]generalized linear modelsMachine Learning (cs.LG)Convex optimizationStatistics - Machine Learning[MATH.MATH-ST]Mathematics [math]/Statistics [math.ST]Optimization and Control (math.OC)FOS: Mathematics[MATH.MATH-OC]Mathematics [math]/Optimization and Control [math.OC]hyperparameter optimizationLassoMathematics - Optimization and Control[MATH.MATH-ST] Mathematics [math]/Statistics [math.ST]
researchProduct

Model averaging estimation of generalized linear models with imputed covariates

2015

a b s t r a c t We address the problem of estimating generalized linear models when some covariate values are missing but imputations are available to fill-in the missing values. This situation generates a bias-precision trade- off in the estimation of the model parameters. Extending the generalized missing-indicator method proposed by Dardanoni et al. (2011) for linear regression, we handle this trade-off as a problem of model uncertainty using Bayesian averaging of classical maximum likelihood estimators (BAML). We also propose a block model averaging strategy that incorporates information on the missing-data patterns and is computationally simple. An empirical application illustrates our…

Generalized linear modelEconomics and EconometricsApplied MathematicsSettore SECS-P/05 - EconometriaEstimatorMissing dataGeneralized linear mixed modelModel averaging Bayesian averaging of maximum likelihood destimators Generalized linear models Missing covariates Generalized missing-indicator method shareHierarchical generalized linear modelStatisticsLinear regressionCovariateApplied mathematicsGeneralized estimating equationMathematics
researchProduct

Weighted-average least squares estimation of generalized linear models

2018

The weighted-average least squares (WALS) approach, introduced by Magnus et al. (2010) in the context of Gaussian linear models, has been shown to enjoy important advantages over other strictly Bayesian and strictly frequentist model averaging estimators when accounting for problems of uncertainty in the choice of the regressors. In this paper we extend the WALS approach to deal with uncertainty about the specification of the linear predictor in the wider class of generalized linear models (GLMs). We study the large-sample properties of the WALS estimator for GLMs under a local misspecification framework that allows the development of asymptotic model averaging theory. We also investigate t…

Generalized linear modelEconomics and EconometricsGeneralized linear modelsBayesian probabilityGeneralized linear modelSettore SECS-P/05 - EconometriaLinear predictionContext (language use)01 natural sciencesLeast squares010104 statistics & probabilityWALS; Model averaging; Generalized linear models; Monte Carlo; AttritionFrequentist inference0502 economics and businessAttritionEconometricsApplied mathematicsStatistics::Methodology0101 mathematicsMonte Carlo050205 econometrics MathematicsWALSApplied Mathematics05 social sciencesLinear modelEstimatorModel averaging
researchProduct

Using the dglars Package to Estimate a Sparse Generalized Linear Model

2015

dglars is a publicly available R package that implements the method proposed in Augugliaro et al. (J. R. Statist. Soc. B 75(3), 471-498, 2013) developed to study the sparse structure of a generalized linear model (GLM). This method, called dgLARS, is based on a differential geometrical extension of the least angle regression method. The core of the dglars package consists of two algorithms implemented in Fortran 90 to efficiently compute the solution curve. dglars is a publicly available R package that implements the method proposed in Augugliaro et al. (J. R. Statist. Soc. B 75(3), 471-498, 2013) developed to study the sparse structure of a generalized linear model (GLM). This method, call…

Generalized linear modelFortranLeast-angle regressionGeneralized linear array modelFeature selectionSparse approximationdgLARS generalized linear models sparse models variable selectionGeneralized linear mixed modelSettore SECS-S/01 - StatisticacomputerGeneralized estimating equationAlgorithmMathematicscomputer.programming_language
researchProduct

Understanding german fdi in latin america and asia: a comparison of glm estimators

2020

The growth of Foreign Direct Investment (FDI) in developing countries over the last decade has attracted an intense academic and policy-oriented interest for its determinants. Despite the gravity model being considered a useful tool to approximate bilateral FDI flows, the literature has seen a growing debate in relation to its econometric specification, so that which is the best estimator for the gravity equation is far from conclusive. This paper examines the determinants of German outward FDI in Latin America and Asia for the period 1996-2012 by evaluating the performance of alternative Generalized Linear Model (GLM) estimators. Our findings indicate that Negative Binomial Pseudo Maximum …

Generalized linear modelLatin Americansfdi determinantsEconomics Econometrics and Finance (miscellaneous)gravity modelsNegative binomial distributionDeveloping countryForeign direct investmentDevelopmentgermany:CIENCIAS ECONÓMICAS [UNESCO]German0502 economics and businessddc:330EconometricsEconomicsC13050207 economicsC33050208 financelcsh:HB71-7405 social sciencesEstimatorlcsh:Economics as a scienceUNESCO::CIENCIAS ECONÓMICASgeneralized linear modelslanguage.human_languageGravity model of tradelanguageF21F23outward foreign direct investment
researchProduct

Influence of environmental factors on the spatial distribution and diversity of forest soil in Latvia

2012

This study was carried out to determine the spatial relationships between environmental factors (Quaternary deposits, topographical situation, land cover, forest site types, tree species, soil texture) and soil groups, and their prefix qualifiers (according to the international Food and Agricultural Organization soil classification system World Reference Base for Soil Resources [FAO WRB]). The results show that it is possible to establish relationships between the distribution of environmental factors and soil groups by applying the generalized linear models in data statistical analysis, using the R 2.11.1 software for processing data from 113 sampling plots throughout the forest terri…

Soil mapRegosolforest typeSoil textureEcologylcsh:QE1-996.5Soil classificationLand coverlcsh:GeologyGeographySoil seriesgeneralized linear models.Unified Soil Classification SystemWorld Reference Base for Soil ResourcesGeneral Earth and Planetary SciencesPhysical geographyQuaternary depositsFAO WRB classificationWater Science and TechnologyEstonian Journal of Earth Sciences
researchProduct

dglars: An R Package to Estimate Sparse Generalized Linear Models

2014

dglars is a publicly available R package that implements the method proposed in Augugliaro, Mineo, and Wit (2013), developed to study the sparse structure of a generalized linear model. This method, called dgLARS, is based on a differential geometrical extension of the least angle regression method proposed in Efron, Hastie, Johnstone, and Tibshirani (2004). The core of the dglars package consists of two algorithms implemented in Fortran 90 to efficiently compute the solution curve: a predictor-corrector algorithm, proposed in Augugliaro et al. (2013), and a cyclic coordinate descent algorithm, proposed in Augugliaro, Mineo, and Wit (2012). The latter algorithm, as shown here, is significan…

Statistics and ProbabilityGeneralized linear modelEXPRESSIONMathematical optimizationTISSUESFortrancyclic coordinate descent algorithmdgLARSFeature selectionDANTZIG SELECTORpredictor-corrector algorithmLIKELIHOODLEAST ANGLE REGRESSIONsparse modelsDifferential (infinitesimal)differential geometrylcsh:Statisticslcsh:HA1-4737computer.programming_languageMathematicsLeast-angle regressionExtension (predicate logic)Expression (computer science)generalized linear modelsBREAST-CANCER RISKVARIABLE SELECTIONDifferential geometrydifferential geometry generalized linear models dgLARS predictor-corrector algorithm cyclic coordinate descent algorithm sparse models variable selection.MARKERSHRINKAGEStatistics Probability and UncertaintyHAPLOTYPESSettore SECS-S/01 - StatisticacomputerAlgorithmSoftware
researchProduct

Extended differential geometric LARS for high-dimensional GLMs with general dispersion parameter

2018

A large class of modeling and prediction problems involves outcomes that belong to an exponential family distribution. Generalized linear models (GLMs) are a standard way of dealing with such situations. Even in high-dimensional feature spaces GLMs can be extended to deal with such situations. Penalized inference approaches, such as the $$\ell _1$$ or SCAD, or extensions of least angle regression, such as dgLARS, have been proposed to deal with GLMs with high-dimensional feature spaces. Although the theory underlying these methods is in principle generic, the implementation has remained restricted to dispersion-free models, such as the Poisson and logistic regression models. The aim of this…

Statistics and ProbabilityGeneralized linear modelMathematical optimizationGeneralized linear modelsPredictor-€“corrector algorithmGeneralized linear model02 engineering and technologyPoisson distributionDANTZIG SELECTOR01 natural sciencesCross-validationHigh-dimensional inferenceTheoretical Computer Science010104 statistics & probabilitysymbols.namesakeExponential familyLEAST ANGLE REGRESSION0202 electrical engineering electronic engineering information engineeringApplied mathematicsStatistics::Methodology0101 mathematicsCROSS-VALIDATIONMathematicsLeast-angle regressionLinear model020206 networking & telecommunicationsProbability and statisticsVARIABLE SELECTIONEfficient estimatorPredictor-corrector algorithmComputational Theory and MathematicsDispersion paremeterLINEAR-MODELSsymbolsSHRINKAGEStatistics Probability and UncertaintySettore SECS-S/01 - StatisticaStatistics and Computing
researchProduct